A strategy for cost efficient distributed data storage for in-memory OLAP

نویسندگان

  • Olga Mordvinova
  • Oleksandr Shepil
  • Thomas Ludwig
  • Andrew Ross
چکیده

With the availability of inexpensive blade servers featuring 32 GB or more of main memory, memory-based engines such as the SAP NetWeaver Business Warehouse Accelerator are coming into widespread use for online analytic processing (OLAP) of terabyte data volumes. Data storage for such engines is often implemented in standard storage technologies like storage area network (SAN) or network attached storage (NAS) with high hardware costs. Given the access pattern, storage costs can be reduced by using a distributed persistence layer based on commodity architecture. We discuss an example of an in-memory OLAP engine with a focus on storage architecture. We then present an implementation of a distributed persistence layer that is optimized for the access pattern of such engines. Finally, we show the cost-saving potential and discuss the performance impact compared to SAN systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity

The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...

متن کامل

Optimum energy management strategy in smart distribution networks considering the effect of distributed generators and energy storage units

The penetration of distributed generation sources and energy storage units in distribution networks is increasing. Therefore, their impact on the reliability of the network is very necessary. In this study, in order to provide an optimal energy management strategy for smart distribution network, the multi-objective optimization problem of dynamic distribution feeder reconfiguration in the pres...

متن کامل

Improving Data Availability Using Combined Replication Strategy in Cloud Environment

As grow as the data-intensive applications in cloud computing day after day, data popularity in this environment becomes critical and important. Hence to improve data availability and efficient accesses to popular data, replication algorithms are now widely used in distributed systems. However, most of them only replicate the static number of replicas on some requested chosen sites and it is ob...

متن کامل

Power Management in a Utility Connected Micro-Grid with Multiple Renewable Energy Sources

As an efficient alternative to fossil fuels, renewable energy sources have attained great attention due to their sustainable, cost-effective, and environmentally friendly characteristic. However, as a deficiency, renewable energy sources have low reliability because of their non-deterministic and stochastic generation pattern. The use of hybrid renewable generation systems along with the storag...

متن کامل

SISYPHUS: A Chunk-Based Storage Manager for OLAP Cubes

In this paper, we present SISYPHUS, a storage manager for data cubes that provides an efficient physical base for performing OLAP operations. On-Line Analytical Processing (OLAP) poses new requirements to the physical storage layer of a database management system. Special characteristics of OLAP cubes such as multidimensionality, hierarchical structure of dimensions, data sparseness, etc., are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009